Anchoring abstractions for near-optimal task and motion planning

نویسندگان

  • William Vega-Brown
  • Nicholas Roy
چکیده

We describe anchored angelic A* (AAA*), an algorithm for efficient and asymptotically near-optimal planning for systems governed by piecewise-analytic differential constraints. This class of systems includes many task and motion planning domains, such as quasi-static manipulation planning. AAA* uses an abstract representation of the problem domain to extract upper and lower bounds on the cost of concrete motion plans. It then uses those bounds to guide the search for a plan while maintaining performance guarantees. We demonstrate the use of AAA* on a simulated continuous Sokoban problem and show that it accelerates planning by an order of magnitude relative to our prior work (which used purely metric heuristics). We consider the problem of integrated task and motion planning through the lens of optimal motion planning under piecewise-analytic differential constraints. This formalism describes a broad family of task and motion planning problems, including navigation among movable obstacles and quasistatic manipulation planning. As a motivating example, we consider a toy problem, illustrated in fig. 1. A circular robot must push K circular objects into specified goal regions. The pose of each object is its position in the plane; the configuration space for the problem is then R. We neglect inertia and assume firstorder dynamics; the robot can move an object only by making contact and pushing, but it cannot pull an object or push multiple objects at once. This problem is the continuous generalization of the warehouse problem, also known as Sokoban. Continuous Sokoban and similar problems combine the challenges of non-convex optimization and combinatorial search, as the robot must decide both the sequence of objects to manipulate and where to move each object. Many authors [3, 9, 10, 12] have studied similar problems; nearly all research effort has concentrated on satisficing planning, where the objective is to return any plan that satisfies some set of objectives. Our focus is instead on optimizing planning, where the objective is to return the best plan that satisfies an objective function like path length or execution time. The few authors who have attempted optimal planning have either focused on restricted cases [4, 14] or local optimality [11]. In contrast, our previous work [13] described a globally asymptotically optimal planning algorithm called FORGG. The core idea of FORGG is to construct a set of random graphs on subsets of the configuration space—called modes— where the dynamics are analytic, which is a condition sufficient to ensure that a large enough random graph will contain Fig. 1. An example of the continuous sokoban problem. The robot (in green, labeled ’R’) must push the seven red, numbered objects to the goal locations marked with blue dots. a near-optimal plan. We can then glue these graphs together into a single planning graph covering the entire configuration space. By taking advantage of the product structure of the differential constraints, we are able to build this graph efficiently, even for large problems. However, the FORGG approach still relies on a best-first search to find the best plan in the graph, and that search is computationally intractable for large problems. In this abstract, we describe the anchored angelic A* (AAA*) search algorithm, which extends our prior work by using an abstraction to plan efficiently while preserving nearoptimality guarantees. There is a rich body of work on using abstraction to accelerate search [2, 5, 6, 7] in the context of satisficing planning, but little attention has been paid to optimizing planning with abstraction.ion to accelerate search [2, 5, 6, 7] in the context of satisficing planning, but little attention has been paid to optimizing planning with abstraction. Our approach combines the insights of the multi-heuristic A* (MHA*) algorithm developed by Aine et al. [1] and the angelic hierarchical planning framework developed by Marthi et al. [8]. The core idea of multi-heuristic A* is anchoring: by simultaneously performing searches with both admissible and inadmissible heuristics, and sharing information between the searches, MHA* can accelerate planning while still ensuring that the plan returned will be near-optimal. Although powerful, this idea is insufficient in task and motion planning problems like continuous Sokoban, as even crude heuristic estimates of the cost to reach a goal state are computationally expensive to compute. We therefore extend the MHA* algorithm to reason over abstract plans, such as “make contact with object 1 and push it to a goal state, then push object 2 to a goal state.” AAA* uses the angelic semantics introduced by Marthi et al. [8] to describe bounds on the cost of high-level actions. In many domains, determining the feasibility or cost of an abstract action is computationally hard. Angelic semantics rely on cheaply computed bounds on the cost or feasibility of higher level actions, rather than requiring exact feasibility checks. These bounds take the place of an inadmissible heuristic in our algorithm; in effect, AAA* simultaneously and incrementally computes the feasibility and minimal cost of abstract plans. The only exact cost and feasibility computations required are for primitive, local actions such as the path of a robot between nearby configurations. In general, the choice of abstraction and cost bounds are domain specific. We use a generic abstraction suitable for many object manipulation problem domains. We consider the set of monotonic plans, which are plans that touch each object at most once. We use the Euclidean length of each action in an abstract plan to lower-bound the cost of any concrete realization of the plan. In easy manipulation problems, when there are good monotonic plans, this allows for very fast planning. In harder problems, we can fall back on the admissible search to find an acceptable plan. AAA* combines the graph construction of FORGG with an abstract search anchored by our admissible heuristic, and is guaranteed to return a (1+ )-optimal plan for any specified > 0. We implemented AAA* in Python and performed qualitative comparisons with FORGG on several small instances of continuous Sokoban. In each case, we searched a graph with 100 poses per object, leading to graph sizes of approximately 10, 10, and 10 for problems with 1, 2, and 3 objects respectively. The parameters of AAA* were chosen to ensure a plan within a factor of 2 of the optimal plan. Results are presented in table I; we observe that AAA* is one to two orders of magnitude faster yet results in only a small decrease in solution quality. In particular, AAA* was able to solve problems with three or more instances, while FORGG failed to find a solution to a problem with more than two objects within twenty minutes of search time. These results are of course preliminary, and further evaluation is planned, both on larger problems and in additional domains. AAA* has strong theoretical properties, and although we have yet to conduct a complete experimental study, early results indicate that it offers a significant speedup over FORGG, the only other asymptotically optimal planning algorithm of which we are aware. Because it provides a way to take advantage of an incomplete abstraction or an inadmissible TABLE I PERFORMANCE COMPARISON Number of objects 1 2 3 Expansions FORGG 237 3197 — AAA* 115 753 12125 Time FORGG 3.30s 58.70s 1200 AAA* 1.38s 2.55s 30.58s Cost FORGG 27.78 45.42 — AAA* 35.64 51.87 163.96 heuristic, AAA* represents an promising avenue for bringing the many years of research on discrete planning and decision making into a new space of continuous planning problems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Near-Minimum-Time Motion Planning of Manipulators along Specified Path

The large amount of computation necessary for obtaining time optimal solution for moving a manipulator on specified path has made it impossible to introduce an on line time optimal control algorithm. Most of this computational burden is due to calculation of switching points. In this paper a learning algorithm is proposed for finding the switching points. The method, which can be used for both ...

متن کامل

Optimal Trajectory Planning of a Box Transporter Mobile Robot

This paper aims to discuss the requirements of safe and smooth trajectory planning of transporter mobile robots to perform non-prehensile object manipulation task. In non-prehensile approach, the robot and the object must keep their grasp-less contact during manipulation task. To this end, dynamic grasp concept is employed for a box manipulation task and corresponding conditions are obtained an...

متن کامل

Trajectory Optimization of Cable Parallel Manipulators in Point-to-Point Motion

Planning robot trajectory is a complex task that plays a significant role in design and application of robots in task space. The problem is formulated as a trajectory optimization problem which is fundamentally a constrained nonlinear optimization problem. Open-loop optimal control method is proposed as an approach for trajectory optimization of cable parallel manipulator for a given two-end-po...

متن کامل

Geometric Abstractions of Vehicle Dynamical Models for Autonomous Intelligent Motion

Motion-planning for autonomous vehicles involves a two-level planning hierarchy: a high-level task-planning algorithm and a lower-level trajectory generation algorithm. The task planner operates on a discrete structure, such as a graph, whereas the trajectory generator operates on a dynamical system with a continuous state space. The problem of ensuring “compatibility” between these two planner...

متن کامل

Near Optimal Behavior via Approximate State Abstraction

The combinatorial explosion that plagues planning and reinforcement learning (RL) algorithms can be moderated using state abstraction. Prohibitively large task representations can be condensed such that essential information is preserved, and consequently, solutions are tractably computable. However, exact abstractions, which treat only fully-identical situations as equivalent, fail to present ...

متن کامل

Direct Optimal Motion Planning for Omni-directional Mobile Robots under Limitation on Velocity and Acceleration

This paper describes a low computational direct approach for optimal motion planning and obstacle avoidance of Omni-directional mobile robots within velocity and acceleration constraints on the robot motion. The main purpose of this problem is the minimization of a quadratic cost function while limitation on velocity and acceleration of robot is considered and collision with any obstacle in the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017